# Billion-parameter ViT
Sapiens Pose 1b
Pose-Sapiens-1B is a high-resolution human pose estimation model based on the Vision Transformer architecture, pre-trained on 300 million 1024x1024 resolution human images, supporting 308 keypoint detections (body, face, hands, and feet).
Pose Estimation English
S
facebook
82
4
Sapiens Seg Foreground 1b Torchscript
Sapiens is a vision transformer model pre-trained on 300 million high-resolution human images, specifically designed for foreground person segmentation tasks.
Image Segmentation English
S
facebook
25
1
Featured Recommended AI Models